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Abstract 

We reported here the use of T4 bacteriophage fi-glucosyltransferase (T4 (3-GT) for the facile synthesis of base J-containing 
oligodeoxyribonucleotides (ODNs). We found that the enzyme could catalyze the glucosylation of 5-hydroxymethyl-2- 
deoxyuridine (5hmU) in both single- and double-stranded ODNs, though the latter reaction occurred only when 5hmU was 
mispaired with a guanine. In addition, base J blocked moderately DNA replication, but it did not induce mutations during 
replication in human cells. 
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Introduction 

Unicellular eukaryotic kinetoplastid flagellates, such as Try- 
panosoma and Leishmania species, contain a unique modified 
base, 5-(P-D-glucosylhydroxymethyl)uracil (a.k.a. base J), in their 
nuclear DNA (Figure 1) [1,2]. Base J replaces approximately 1% 
of thymine (T) in the nuclear DNA of these species, but this 
modified base is absent in other eukaryotes, prokaryotes or viruses, 
rendering biosynthesis of J a potential therapeutic target against 
pathogenic kinetoplastids [3]. Base J is produced by oxidation of 
thymine to 5-hydroxymethyluracil and subsequent glucosylation of 
the latter modified nucleobase [3] . The oxidation step is catalyzed 
by the J-binding proteins JBP1 and JBP2, which are members of 
the TET-JBP superfamily of dioxygenases [4,5]. Along this line, 
the mammalian TET enzymes catalyze the corresponding 
oxidation of 5-methylcytosine to 5-hydroxymethylcytosine [6] as 
well as further oxidation of the latter to 5-formylcytosine and 5- 
carboxylcytosine [7,8]. 

The enzyme catalyzing the glucose transfer reaction involved in 
base J biosynthesis has yet been identified, though many 
experimental approaches including complementation in cell 
extracts and RNAi knockdown of candidate genes have been 
attempted [4]. In this vein, it is worth noting that, through 
bioinformatic analysis of biochemical pathways for DNA modifi- 
cations, Avarind et al. [9] recendy identified a putative glucosyl- 
transferase with an operonic association to a JBP-related gene in 
several phage genomes. The authors postulated that the TET/ 
JBP-associated glycosyltransferases (or TAGTs) may glycosylate 
substrates (i.e., 5-hydroxymethyluracil) generated by the JBP-like 
enzymes [9] . Interestingly, the bioinformatic analysis also revealed 
the presence of the mammalian ortholog of this glucosyltransferase 



(i.e., GREB1) [9], raising the possibility of the existence ofj analog 
in mammalian genome. 

In Leishmania, a small fraction of base J is located at 
transcription termination sites [10]. Loss of base J results in 
massive transcription read-through at these sites, suggesting that 
base J regulates RNA polymerase II (Pol II)-mediated transcription 
by stalling Pol II or specifying transcriptional termination [10]. It 
remains elusive whether base J affects other aspects of DNA 
metabolism, especially DNA replication. 

Understanding the biological function and characterizing the 
biophysical properties of base J at the molecular level necessitate 
the availability of oligodeoxyribonucleotides (ODNs) containing a 
site-specifically inserted base J. Along this line, base J-carrying 
ODNs have been previously synthesized using conventional 
phosphoramidite chemistry and automated solid-phase DNA 
synthesis [11-13]. This method generally involves multiple 
synthetic steps, and it has relatively low overall yields. Several 
efficient chemoenzymatic approaches have been developed for the 
preparation of sugar-nucleotide derivatives [14—16]. We reason 
that a similar method might be useful for the synthesis of base J- 
containing ODNs. However, so far, enzymes that can efficiendy 
catalyze this reaction have not been discovered yet. Here, we 
found that (3-glucosyltransferase of Escherichia coli T4 bacterio- 
phage (T4 fi-GT) could catalyze the glucosylation of 5hmU in 
single-stranded ODN or in double-stranded ODN when 5hmU is 
mispaired with a guanine (G). Additionally, with the use of this 
method, we were able to prepare sufficient base J-containing ODN 
for assessing how base J compromises DNA replication in cells. We 
found that, in contrast to its strong blocking effects on DNA 
transcription, base J moderately impedes DNA replication in 
human cells and it does not induce mutations during this process. 
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Materials and Methods 

Materials 

Unmodified oligodeoxyribonucleotides (ODNs) used in this 
study were purchased from Integrated DNA Technologies 
(Coralville, IA, USA). [y- 32 P]ATP was obtained from Perkin 
Elmer (Piscataway, NJ, USA). Shrimp alkaline phosphatase was 
obtained from the USB Corporation (Cleveland, OH, USA). T4 
phage P-glucosyltransferase (T4 P-GT) and all other enzymes 
unless otherwise specified were purchased from New England 
BioLabs (NEB). l,l,l,3,3,3-Hexafluoro-2-propanol (HFIP) was 
purchased from TCI America (Portland, OR, USA). Chemicals 
unless otherwise noted were obtained from Sigma-Aldrich (St. 
Louis, MO, USA). 

The 5hmU-containing ODN (5 ' -ATGGC G5hmUGCTAT-3 ') 
was synthesized following previously published procedures [17], 
and the identity of the modified ODN was confirmed by 
electrospray ionization-mass spectrometry (ESI-MS) and tandem 
MS (MS/MS) analyses [18]. We chose this particular sequence 
context because we previously conducted replication studies for a 
number of DNA lesions in the same sequence context [19-21]. 

Detection of the T4 p-GT activity on different ODN 
substrates 

The above 12 mer 5hmU-bearing ODN was annealed with a 20 
mer complementary ODN (5'-ATAGCXCGCCATGAGCTC- 
GAGA-3') ('X' is an A, T, C or G). The single-stranded 12 mer 
5hmU-bearing ODN or the annealed double-stranded ODNs (30 
pmol each) were added to a 10-L T4 (3-GT reaction buffer 
containing T4 fi-GT (5 units) and uridine diphosphate-glucose 
(UDP-Glc, 0.04 mil). The mixture was incubated at 25°C for 
30 min followed by heating at 65°C for 10 min. The above 
mixture (1 |J,L) was then incubated in a 10-L T4 polynucleotide 
kinase (T4 PNK) buffer with 5 mM DTT, ATP (50 pmol cold, 
premixed with 1.66 pmol [y- 32 P]ATP) and 5 units of T4 PNK. 
The reaction was continued at 37°C for 1 h, followed by 
quenching with 10 |jL formamide gel loading buffer containing 
xylene cyanol FF and bromophenol blue dyes. The mixture was 
loaded onto a 30% denaturing polyacrylamide gel (acrylamide:bis- 
acrylamide = 19:1) containing 8 M urea (Figure SI in File SI). 

Preparation of base J-containing ODN 

For the preparation of base J-harboring ODN at a larger scale, 
30 nmol of the 5hmU:G mismatch-containing double-stranded 



ODN, which was dissolved in a 1-mL NEB buffer 4, was mixed 
with a 50-L solution containing 2 mM UDP-Glc and 500 units T4 
P-GT. After incubation at 37°C for 2 h, 200 units T4 P-GT and 
1 0 U.L 2 mM UDP-Glc were added, and the reaction was allowed 
to proceed at 37°C for up to 6 h. The reaction mixture was 
extracted once with phenol/chloroform/isoamyl alcohol (25:24:1, 
v/v). The aqueous portion was concentrated with Speed-vac, and 
separated by HPLC. The products were purified on a Beckman 
HPLC system with pump module 125 and a UV detector (module 
126). A 4.6x250 mm Apollo C18 column (5 |J.m in particle size 
and 300 A in pore size; Alltech Associate Inc., Deerfield, IL) was 
used. HFIP buffer (400 mM, pH adjusted to 7.0 with triethyl- 
amine, solution A) and methanol (solution B) were employed as 
mobile phases. The flow rate was 0.8 mL/min, and a gradient of 
5-25% B in 5 min, 25-40% B in 50 min, and 40-80% B in 5 min 
was employed for the separation (Figure 2). The purified ODNs 
were desalted on the same HPLC system with H a O as mobile 
phase A and methanol as mobile phase B, and a gradient of 5% B 
in 20 min, 5-50% B in 1 min, and 50% B in 25 min was used. 
The identity of the base J-containing ODN was again confirmed 
by LC-MS/MS analysis (Figure 3), which was carried out using an 
Agilent 1100 capillary HPLC pump (Agilent Technologies, Palo 
Alto, CA) and an LTO_ linear ion-trap mass spectrometer (Thermo 
Electron, San Jose, CA). 

Determination of bypass efficiency using competitive 
replication and adduct bypass assay 

We first constructed the base J- and 5hmU-bearing double- 
stranded shuttle vector by using a previously described method 
(Figure S2 in File SI) [19,22,23]. Briefly, we nicked the parent 
vector with Nt.BstNBI to produce a gapped vector by removing a 
25-mer single-stranded ODN, followed by filling the gap with a 12 
mer base J- or 5hmU-bearing ODN (5 '-ATGGCGXGCTAT-3 ' , 
X = base J or 5hmU), and a 13 mer unmodified ODN (5'- 
TGTGGAGTCGATG-3'). The resulting supercoiled lesion-bear- 
ing plasmid was isolated by agarose gel electrophoresis. Using the 
same method, we prepared the lesion-free control plasmid, where 
the 1 2 mer base J-containing ODN was replaced with a 1 2 mer 
ODN (5'-ATGGCGTGCTAT-3') during the ligation reaction. 

The HEK293T cells (6x10 ) were seeded in 6-well plates and 
cultured overnight, after which they were transfected with 500 ng 
plasmids by using Lipofectamine 2000 (Invitrogen) following the 
manufacturer's instructions. The cells were harvested at 24 h 
following transfection, and the progenies of the plasmid were 



HO 




O HO 



OH 




5hmU 5hmC Base J Glc-5hmC 

Figure 1. Chemical Structures of 2 -deoxynucleosides containing 5hmU, 5hmC, base J and Glc-5hmC. 

doi:10.1371/journal.pone.0103335.g001 
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Figure 2. HPLC trace for the separation of a reaction mixture containing 12 mer base J- and 5hmU-containing ODNs. Peaks 1, 2 and 3 

represent the 12 mer base J- and 5hmU-containing ODNs and complementary ODN, respectively. 

doi:10.1371/journal.pone.0103335.g002 



isolated by using an alkali lysis method [24]. The residual 
unreplicated plasmids were removed by Dpnl digestion, followed 
by digesting the resulting linear DNA with exonuclease III as 
described elsewhere [25]. 

The progeny plasmids were subsequently amplified by PCR 
using Phusion high-fidelity DNA polymerase (New England 
Biolabs, Ipswich, MA). The primers flanking the site where the 
lesion was initially inserted were 5'-CTTTCCAAAATGTCG- 
TAACAACTCC-3' and 5'-CAACACTCAACCC- 

TATCTCGGTCTAT-3', and the amplification cycle was 36, 
each consisting of 10 s at 98°C, 30 s at 65°C, and 45 s at 72°C, 
with a final extension at 72°C for 5 min. The PCR products were 
purified by QJAquick PCR Purification Kit (Qiagen) and stored at 
-20°C until use. For PAGE analysis, a portion of the PCR 
fragments was treated with 5 units Ncol and 1 unit shrimp alkaline 
phosphatase at 37°C in 10 |0,L NEB buffer 3 for 1 h, followed by 
heating at 80°C for 20 min to deactivate the shrimp alkaline 
phosphatase. The above mixture was then treated in a 15 |jL NEB 
buffer 3 with 5 mil DTT, ATP (50 pmol cold, premixed with 1 .66 
pmol [y- 32 P] ATP) and 5 units T4 PNK. The reaction was 
continued at 37°C for 1 h, followed by heating at 65°C for 20 min 
to deactivate the T4 PNK. To the reaction mixture was 
subsequently added 5 units SfaNI, and the solution was incubated 
at 37°C for 1 h, followed by quenching with 15 |jL formamide gel 
loading buffer containing xylene cyanol FF and bromophenol blue 
dyes. The mixture was loaded onto a 30% polyacrylamide gel 
(acrylamide:bis-acrylamide = 19:1) and products quantified by 
phosphorimager analysis. The bypass efficiency was calculated 
using the following formula, %bypass = (lesion vector bottom- 
strand signal/lesion vector top-strand signal)/(control vector 
bottom-strand signal/ control vector top-strand signal). 

Identification of replication products by using LC-MS/MS 

To further identify the replication products using LC-MS/MS, 
PCR products were treated with 50 units Nco I, 50 units SfaN I 
and 20 units shrimp alkaline phosphatase in 200-(XL NEB buffer 3 
at 37°C for 4 h, followed by heating at 65°C for 20 min (Figure S3 
in File SI). The resulting solution was then extracted once with 
chloroform/isoamyl alcohol (24:1, v/v), and the aqueous portion 
was desalted by HPLC and dried with Speed-vac and then 



dissolved in H 2 0. The ODN mixture was subjected to LC-MS/ 
MS analysis on an LTQ, linear ion-trap mass spectrometer, which 
was set up for monitoring the fragmentation of the [M-3H]' 5 ions 
of the 13 mer ODNs [d(CATGGCGTGGTAT) and 
d(CATGGCGXGCTAT), where "X" designates A, T, C, or G]. 

Results and Discussion 

In nature, T4 P-GT catalyzes the transfer of a glucose residue 
from UDP-Glc to 5-hydroxymethyl-2'-deoxycytidine (5hmC) in 
double-stranded DNA, yielding 5-(P-glucosylhydroxymefhyl)-2'- 
deoxycytidine (Glc-5hmC) (Figure 1) [26]. We reasoned that this 
enzyme may also be employed for the glucosylation of 5hmU to 
yield base J in DNA. To test this, we incubated a 12 mer ODN 
containing a single 5hmU with T4 P-GT in the presence of UDP- 
Glc. Our results showed that indeed T4 fJ-GT could catalyze the 
glucosylation of 5hmU in single-stranded ODN, albeit at a very 
low yield (Figure SI in File SI). Hence, this result revealed that T4 
P-GT possesses the enzymatic activity in inducing the glucosyla- 
tion of 5hmU. T4 P-GT is known not to exhibit sequence 
specificity [26]; however, the crystal structure of T4 P-GT in 
complex with duplex DNA revealed the direct interaction between 
several amino acid residues in the protein with the G residue that 
is paired with 5hmC [27,28]. This observation prompted us to ask 
whether the 5hmU residing in a 5hmU:G mispair may serve as a 
more robust substrate for T4 P-GT than when it is paired with an 
A. To exploit this possibility, we annealed the 12 mer 5hmU- 
containing ODN with 20 mer complementary sequences in which 
the nucleobase paired with 5hmU was an A, T, C or G. In this 
vein, we chose a 20 mer complementary strand because it can be 
resolved readily from the 5hmU- or base J-containing strand by 
PAGE or HPLC analysis (Figure 2). After treating these double- 
stranded ODNs with T4 P-GT, only the duplex with the 5hmU 
being mispaired with G gives the glucosylated product, with a yield 
being much higher than that in single-stranded ODN (Figure S 1 in 
File SI, and the identity of the base J-containing ODN was 
confirmed by ESI-MS and MS/MS analyses, as depicted in 
Figure 3). The greater yield of base J obtained from duplex DNA 
substrate could be attributed to the stronger binding affinity of the 
enzyme toward double- than single-stranded DNA [29]. The 
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Figure 3. ESI-MS and MS/MS characterizations of d(ATGGCGJGCTAT) ('J' represents base J): (A) Negative-ion ESI-MS; (B) The 

product-ion spectrum of the [M-3H] 3 ~ ion [m/z 1283.8). 

doi:10.1371/journal.pone.0103335.g003 



higher reactivity observed for the 5hmU:G mispair than 5hmUA, 
5hmU:C or 5hmU:T base pair shows that the G in the opposite 
strand plays an instrumental role in binding with T4 p"-GT and in 
facilitating the glucosylation of the opposing 5hmU. 

Encouraged by the above results, we conducted the reaction at a 
larger scale by using 30 nmol of the above-described 5hmU:G 
mispair-containing 12 mer/20 mer duplex (See Materials and 
Methods), and subjected the reaction mixture to HPLC analysis. 
Our results showed that the 1 2 mer base J- and 5hmU-containing 
ODNs could be completely resolved from each other and from the 
complementary 20 mer ODN by HPLC (Figure 2). After HPLC 
separation and desalting, a total of 7.2 nmol base J-containing 
ODN was obtained at a yield of 24%. Furthermore, the unreacted 
5hmU-containing ODN could be recovered for the next round of 
reaction. T4 (5-GT does not display strong sequence specificity, 



and it can glucosylate all available 5-hmC bases in DNA [26,30]; 
therefore, the method should be applicable for the synthesis of 
base J-containing ODNs in any predefined sequences. 

T4 p"-GT was previously employed for conjugating various 
modified glucose derivatives with 5hmC in DNA [31,32]. 
Furthermore, some glucosyltransferase might catalyze the forma- 
tion of GlcNAc-5hmU in some phages (e.g. Mycobacteriophage 
Acadian) [9]. Thus, T4 P-GT may be potentially useful for 
transferring modified Glc derivatives to 5hmU in ODNs, which 
may be utilized for introducing glucose derivatives with a 
bioorthogonal handle for labeling and assessing the distribution 
of 5hmU in DNA. 

Owing to the presence of a bulky glucose ring, base J blocks 
DNA transcription [10]; however, it remains unexplored how base 
J affects DNA replication. Thus, we investigated how base J 
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Figure 4. In-vivo replication studies of base J and 5hmU in HEK293T cells. (A) Experimental procedures for determining the effects of 5hmU 
and base J on DNA replication in cells. (B) Representative PAGE gel image showing the restriction fragments. (C) The bypass efficiencies of base J and 
5hmU in HEK293T cells. The data represent the mean and standard deviation of results from three independent replication experiments. 
doi:10.1371/journal.pone.0103335.g004 
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perturbs DNA replication in mammalian cells using our recently 
developed shuttle vector method [22]. We first constructed the 
5hmU- and base J-containing double-stranded plasmids, and 
transfected them into HEK293T cells. After cellular DNA 
replication, the progeny plasmids were isolated and amplified 
with PCR. The PCR products were subsequently digested with 
restriction enzymes, and the resulting restriction fragments were 
analyzed by PAGE and LC-MS/MS for product identification 
and quantification (Figure 4A). In this respect, the negative-ion 
ESI-MS and MS/MS data identified the [M-3H] 3 " ions of non- 
mutagenic products d(CATGGCGTGCTAT) ([M-3ri] 3 ~, m/z 
1320.4) and d(C ATGGC GTGGTAT) ([M-3H] 3- , m/z 1333.1) 
(Figure S4 & S5 in File SI). The ion of m/z 1315.7 observed in 
Figure S4 in File SI could be attributed to the [M-3H] 3 " ion of 
the bottom-strand sequence with T^C mutation, i.e., 
d(CATGGCGCGCTAT), or the [M+K-4H] 3 " ion of the top- 
strand sequence, i.e., d(CACAATAGCACGC). The MS/MS 
showed that the ion of m/z 1315.7 arises from the top-strand 
sequence d(G AC AATAGC AC GC) (Figure S6 in File SI). Our 
LC-MS/MS results also revealed the absence of T^A or T^G 
mutation products, i.e., d(CATGGCGAGCTAT) or 
d(CATGGCGGGCTAT), in the restriction digestion mixture. 
Taken together, only the non-mutagenic sequences, i.e., 
d(C ATGGC GTGGTAT) and d(CATGGCGTGCTAT) could 
be detected in the digestion mixtures for samples arising from 
the in vivo replication of base J- and 5hmU-containing substrates, 
which is in line with what we found from PAGE analysis 
(Figure 4B). 

The bypass efficiencies were calculated from the ratio of the 
restriction products from the base J- or 5hmU-containing strand 
over that of the lesion-free strand (Figure 4B). Our results revealed 
that the bypass efficiencies for 5hmU and base J are approximately 
80% and 52%, respectively (Figure 4C). Thus, 5hmU and base J 
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constitute modest and moderate blocks to DNA replication 
machinery in human cells, respectively. Our result also revealed 
the absence of mutation introduced by base J during replication. 
Loss of base J could lead to cell death in Leishmania [10]. After 
DNA replication, some thymine residues in the nascent DNA 
strands must undergo rapid conversion to yield base J so that the 
levels of the modified base can be maintained. The high fidelity in 
replication across base J and its intermediate 5hmU is essential for 
the maintenance of base J levels during cell division. 

Conclusions 

In summary, we reported an enzymatic method for the facile 
synthesis of base J-containing ODNs. The method also holds 
potential for incorporating modified glucose derivatives to 5hmU 
in ODNs. In addition, we demonstrated that, in contrast to its 
strong inhibitory effects on DNA transcription, base J only 
moderately impedes DNA replication in human cells. 
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